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Abstract. We have developed a method 'time- frequency (TF) clustering' to find 
the burst gravitational waves for TAMA data analysis. TF clustering method on 
sonogram (spectrogram) shows some characteristics of short duration signal. Using 
parameters which represent the cluster shape, we can efficiently identify some predicted 
gravitational wave forms [1] [2] and can exclude typical unstable spike like noises due to 
detector instruments. The requirement of some parameters of cluster achieved roughly 
50% average efficiency for injected DFM waveforms of hrss ~ 2 x 10~^^ for type I burst. 
Also the reduction for signal by spike noises are more than one order improvement for 
the SNR>100. 
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1. Introduction 

It is expected that stellar core collapses of supernova will radiate the burst gravitational 
waves, which frequency is suitable to detect with ground-based gravitational wave 
detectors. The predicted burst gravitational waves have a some peculiar characteristics 
: Type I) big narrow spike of ~1 msec and following 10-15 msec decay ringing, Type II) 
a few msec spikes iterate a few times with ~10 msec interval. Type III) broad excess of 
~5 msec with fine structure of ~1 msec. The predicted burst gravitational waves have 
a duration and tail of signals. Since the tail structure and frequency spread is not same 
as popular spike noises which caused by the detector instruments, it will be possible 
to differentiate the gravitational waves from noises. However, it is hard to predict the 
gravitational wave forms in analytically or hard to fit with a few parameters. 

To identify such a signal, we have developed a new method by examining 'clustering' 
of signal powers in time-frequency domain [3]. We study using TAMA observation data 
and numerical predicted gravitational waveforms by Dimmelmeier at al.[l][2] 

In some other time-frequency domain analysis, which especially use 'cluster', were 
studied as reference [4], to find efficient power filter window. Our practical issue is 
employment much more parameters and find efficient combinations of parameters. 

2. TAMA data and preprocess 

TAMA300 is a 300 m baseline laser interferometric gravitational wave detector in 
Mitaka, Tokyo, Japan[5]. The interferometer is Fabry- Perot-Michelson type with a 
power recycling technique. The strain sensitivity of the detector is 1.5 x 10~^^[/v^Hz] 
around IkHz, and the integral observation time is more than 3000 hours [5] [6] [7]. The 
typical strain sensitivity of TAMA detector in 9th data taking (DT9) is displayed in 
figure 3 of the reference [7]. Gravitational waves from stellar-core collapse of supernovae, 
or quasi-normal mode oscillation of black holes, has the typical time scale 10~100 msec. 
It corresponds to the good sensitive observation band lOOHz^lkHz of ground-based 
gravitational wave detectors as TAMA300. 

2.1. raw data and simulation data 

The time series signal from the TAMA interferometer is digitized by analogue-to-digital 
converter of 16 bit depth. The sampling interval is 50/xsec(=20kHz sampling). The raw 
data as time series signal v{t) is stored in frame format [8]. One data file consists from 
20 frames, and each frame is 3.2768 sec time series signal. For the convenient of the 
event search analysis, we repack raw data frames into new frame of 52.4288 sec data for 
2^° samples. For this analysis, we use part of DTD real data which has good stableness 
and better sensitivity. 

To investigate the search performance for burst gravitational wave, we used 
predicted burst gravitational waves by Dimmelmeier et al.[l][2] (DFM waveforms). We 
inject DFM waveform by software (off-line injection) on TAMA data. Since this signal 
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v{t) is given after the detector response, electric servo and whitening filter, we convert 
DFM waveform from strain h{t) to v{t) using transfer function, and embed to raw data. 

Also gaussian noise simulation data is used. Since real data include many unstable 
spikes or glitch noises which have short time duration, these will make spurious signals 
of gravitational waves. Gaussian noise simulation data need to evaluate the statistical 
fluctuation of stable noises. 



2.2. Line removal 

Before transforming the time series data to time-frequency domain, we remove the line 
noises which come from AC power line and its harmonics, mechanical resonances of 
suspension, etc. The detail is given in the references [7] [9]. 



2.3. Short-FFT (sonogram) 

We split time series data to 3.2 msec chunks. Time shift of next chunk is 0.8 msec, thus 
the neighbor chunks overlap 2.4 msec. The figure 1 displays this process. For each of 
the chunks, signal powers are calculated by using FFT (short-time Fourier transform, 
SFFT) with a Hanning window. The frequency resolution in the SFFTs is 312.5Hz. 
We dispose these SFFTs in time order, which is called 'sonogram' in time-frequency 
domain. One bin of data of time-frequency domain is displayed as 'pixel'. Each 'pixel' 
represents the signal power at a given time and a frequency. SFFT spectrum at each 
frequency band is normalized using average noise. Thus the sonogram will appear as 
flat pixels with gaussian noise. The figure 2 and 3 are examples of sonogram of TAMA 
data around spike like noise, and typical DFM waveform injected in the TAMA300 data. 
Sonogram illustrate the pixel magnitude using color. 
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Figure 1. Short-FFT (sonogram) process schematic 



3. Clustering in time-frequency domain 

In time-frequency domain sonogram, we can find a closed pixels of which have higher 
magnitude in the figure 3 as 'cluster' of signal. 
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Figure 2. sonogram example: TAMA spike like noise 

Spike noises spread entire the frequency band, but concentrate in short time duration. 
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Figure 3. sonogram example: burst gravitational wave (DFM waveform) injected 
TAMA data 

Typical DFM waves not widely spread in frequency, but have a tail along the time 
axes. 



To identify and define a cluster are as follows: 1) We find all pixel which magnitude 
exceed signal-to- noise ratio SNR > 3. 2) We assume that a local maximum pixel is a 
'peak' of the cluster. The peak specified with its time to and frequency /q. 3) If one 
of a 8 pixels around the peak of the cluster exceed the cluster threshold, this pixel is 
'connected' pixel. The cluster threshold is settled as square root of SNR of the peak 
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pixel. 4) We check the neighbor of connected pixels, again. 5) If no more connected pixel 
cannot be found, we define the peak and these connected pixels as one 'TF-cluster'. 6) 
We will iterate 2)-5) exclude the pixel which belongs to previous cluster, for other local 
peak. 

3.1. Characteristic parameters of clusters 

When the cluster found, we calculate some parameters which shows cluster character. 
Each cluster can be characterized by the absolute power, spread area, shape of power 
distribution et al. Figure 4 displays the schematic of projection of the cluster to each 
axes. We can project the cluster by the numbers of pixels, or integrated magnitude of 
pixels along the orthogonal axes. We employ following parameters to quantitate the 
shape of the projection; the number of pixels in the cluster for area 5* , the sum of a 
magnitude of pixels for cluster volume V, tNs as A^-th moment of projection of area 
(=the numbers of pixels) along the time axes, tNv as A^-th moment of projection of 
magnitude along the time axes, fNs as same as a tNs along the frequency axes, fNv 
as same as a tNv along the frequency axes. 

{tls, /Is) is a mean center of the cluster area, and {tlv, flv) is a center of 
magnitude of the cluster. Higher orders tNs corresponds to the standard deviation, 
skewness et al. 




Figure 4. TF-cluster and its projection to each axes. 
The pixels inside white border is recognized as one cluster. 



4. Event selection criteria 

Some combinations of the parameters makes possible to separate gravitational wave 
signal from many noises. The figure 5 and 6 shows the example of two combination 
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which efficiently separate the gaussian noise from the DFM wave forms. 

In case of figure 5 (/Is - /2s scatter plot), injected DFM wave forms concentrate 
within the box of blue solid line. However, many TAMA noises spread wider region. 
Most of TAMA noises are spike noises. One can see that the DFM waveforms can be 
efficiently separated from spike noises in the figure. On the other, the figure 6 shows the 
separation from gaussian noises. Gaussian noises will make spurious clusters accidentally 
according to the statistical fluctuations. Almost of them have small size, and parameters 
correlation is different from spike noise of gravitational waves. In this tls/S - fls/S 
scatter plot, both spike noises and DFM waveforms concentrate in the blue circle. 

In this study, we require all the criteria in the table 1 for the selection of burst 
gravitational wave candidates. 

Table 1. Selection criteria of cluster parameters for burst gravitational waves 



parameters brief meaning 



criteria 

(I'nit is [])ixc4s] exee])t for fo) 



S area size of the cluster 

fo frequency at the peak of the cluster 



S>4: 

fo < 1250 [Hz] 



For the frequency projection of the cluster; 

/Is distance from center to the to the peak 

/2s standard deviation around /Is 

fiv kurtosis around power weighted center flv 



-2.0 < /Is < 2.0 
/2s < 5.0 
fAv < 6.0 



For the time projection of the cluster; 

tlv distance from weighted center to the to the peak 

t2v standard deviation around tlv 



-1.5 < tlv < 1.5 
t2v < 3.0 



In combination; 
t2v, S 



't2v 



> 0.04 



tls', t2s\ S 



VtT^~+t2^ 



< 0.15 



4-1- Efficiency for burst gravitational waves 

With the criteria in the table 1, we estimate the efficiency by using Monte-Carlo 
simulation with random injection of the DFM gravitational waves in TAMA data. The 
figure fig:distVSeff shows the efficiency for TF-clusters of source distance from the earth. 
The efficiency depend on the type of burst waveform. Also the figure fig:hrssVSeff shows 
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Figure 5. Scatter plot of cluster parameters /Is - /2s 

Red cross points is noises in real data. Others are injected gravitational waves (DFM 
waveforms) in simulation data. 
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Figure 6. Scatter plot of cluster parameters ils/5 - fls/S 

Cluster due to gaussian noise fluctuation is spread uniformly in this figure. 



the efficiency for hrss '■ root square sum of strain amplitude. The selection criteria for 
clusters which peak SNR>3 achieved 50% average efficiency for injected DFM waveforms 
at 350 pc {Kss ~ 2 X 10^^^) for type I burst [1], and at 170 pc for all average. We take 
note that the efficiency looks higher than previous our analysis[7], because we set lower 
SNR threshold for the comparison of clusters. If we wish to derive better upper limit of 
event search, we must set more higher threshold of SNR to reject small spurious noise. 
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Figure 7. TF-cluster selection efficiency for injected DFM waveforms (source 
distance) 
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Figure 8. TF-cluster selection efficiency for injected DFM waveforms (hrss) 
4-2. Noise reduction 

TF-cluster selection criteria can reject typical noises efficiently in real data. Using a 
part of TAMA DT9 data (1.26 x lO^sec), we compare the SNR histogram of 'peak' 
before and after cluster parameter selection. The reduction for spurious event clusters 
by spike noises are more than one order improvement for the SNR>100. The reduction 
is efficient for higher SNR clusters for SNR>200. Because that TF-cluster method is 
efficient when the waveform is clear. 
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Figure 9. Histogram of SNR for real data 

Histogram of cluster SNR is shown for a part of TAMA DT9 data. Green dashed line 
is all found cluster in TF domain. Red line is after the selection by criteria in the table 
1. 



5. Summary and future plan 

We recently practice the newer search algorithm as TF-cluster using many cluster 
parameters. The method is efficient to select gravitational event and reject noise fake, 
especially in larger SNR case. Since the TF-cluster method is not require the exact 
template of wave form, it is useful to identify the burst gravitational waves which we 
believe only the typical characteristics is knows, or which predicted numerically without 
analytic waveform. 

In the sonogram of this analysis, we employed SFFT, which has uniform time 
and frequency resolution. However, this is not optimal time-frequency resolution 
in an uncertainty principle of short wave packet. According to the complemental 
relation between the time and the frequency, we can improve time resolution in high 
frequency band, can improve frequency resolution in lower frequency band instead of 
time resolution. We began to employ with wavelet transform to achieve such resolution 
and to make more clear appearance of burst gravitational waves. 
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